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Abstract — Minimum mean square error (MMSE) estimation of 
block sparse signals from noisy linear measurements is consid- 
ered. Unlike in the standard compressive sensing setup where the 
non-zero entries of the signal are independently and uniformly 
distributed across the vector of interest, the information bearing 
components appear here in large mutually dependent clusters. 
Using the replica method from statistical physics, we derive a 
simple closed-form solution for the MMSE obtained by the 
optimum estimator. We show that the MMSE is a version of 
the Tse-Hanly formula with system load and MSE scaled by 
parameters that depend on the sparsity pattern of the source. 
It turns out that this is equal to the MSE obtained by a genie- 
aided MMSE estimator which is informed in advance about the 
exact locations of the non-zero blocks. The asymptotic results 
obtained by the non-rigorous replica method are found to have 
an excellent agreement with finite sized numerical simulations. 



I. Introduction 

Compressive sensing (CS) |jT|, Q tackles the problem of 
recovering a high-dimensional sparse vector from a set of 
linear measurements. Typically the number of observations is 
much less than the number of elements in the vector of interest, 
making naive reconstruction attempts inefficient. In addition to 
being an under-determined problem, the measurements may 
suffer from additive noise. Under such conditions, the signal 
model for the noisy CS measurements can be written as 

y = Ax + neR^'\ (1) 

where x E is the sparse vector of interest, A E M^^^^ the 
measurement matrix, and n E M*^ represents the measurement 
noise. By assumption, M < N and only some of the elements 
of X are non-zero. The task of CS is then to infer x, given 
A, y and possibly some information about the sparsity of x 
and the statistics of the noise n. 

In this paper, the vector x is assumed to have a special block 
sparse structure. Such sparsity patterns have recently been 
found, e.g., in multiband signals and multipath communication 
channels (see, e.g., |j3)-||6j and references therein). More 
precisely, the source is considered to be K block sparse so 
that for any realization of x, its information bearing entries 
occur in at most K non-overlapping clusters. This is markedly 
different from the conventional sparsity assumption in CS, 
where the individual non-zero components are independently 

A previous version of tliis paper was publislied in IEEE ITW'll, Paraty 
Brazil, October 16-20, 2011. There was. however, a mistake in the main result 
and proof, albeit the main conclusion given in Corollary^ of the paper was 
still correct. The results and proofs have been connected in this arXiv version. 



and uniformly distributed over x. Given the K block sparse 
source, we study the minimum mean square error (MMSE) es- 
timation of X, assuming full knowledge of the statistics of the 
input X and the noise n. Albeit this is an optimistic scenario 
for practical CS problems, it provides a lower bound on the 
MSE for any other reconstruction method. Also, knowing the 
benefits of having the statistics of the system at the estimator 
gives a hint how much the sub-optimum blind schemes could 
improve if they were to learn the parameters of the problem. 

The main result of the paper is the closed-form MMSE for 
the CS of block sparse signals. The solution turns out to be of 
a particularly simple form, namely, the Tse-Hanly formula |j7| 
where the system load and MSE are scaled by parameters that 
depend on the sparsity pattern of the source. This is found 
to be equal to the MSE obtained by a genie-aided MMSE 
estimator that is informed in advance the locations of the non- 
zero blocks. The result implies that if the statistics of the block 
sparse CS problem are known, the MMSE is independent of 
the knowledge about the positions of the non-vanishing blocks. 

Finally, we remark that the analysis in the paper are obtained 
via the replica method (RM) from statistical physics. Albeit the 
RM is non-rigorous, it has been used with great success for the 
large system analysis of, e.g., multi-antenna systems ||8J,J9J, 
code division multiple access |[T0[, 1 1 1 1, vector precoding p2). 



iterative receivers |13 | and compressed sensing p4)-p7l. The 
main difference here compared to ||9j-|[l7) is that the elements 
of the K block sparse vector x are neither independent nor 
identically distributed. This requires a slight modification to 
the standard replica treatment, as detailed in the Appendix. 



A. Notation 

The probability density function (PDF) of a random vector 
(RV) X E M.^ (assuming one exists) is written as p{x) and 
conditional densities are denoted p{x \ ■ ■ ■). The related PDFs 
postulated by the estimator are denoted q{x) and q{x \ ■ ■ ■), 
respectively. For further discussion on the so-called gener- 
alized posterior mean estimation using true and postulated 
probabilities, see for example, pO| , pT) , p3j . We denote 
y ^ p{y) = 9N{y I S) for a RV y that is drawn according 
to the TV-dimensional Gaussian density gN{y \ fJ,; S) with 
mean E M.^ and covariance S E M^^^. For a vector x 
that is drawn according to a Gaussian mixture density, we have 



2 



(2) 



where the density parameters ojr satisfy J2r=i ^'r = ^ 
> for all r = 1, . . . 

We write 1q e M'^ for the all-ones vector having Q 
elements, and given vectors {dy. e the diagonal 

matrix D = diag(di, . . . , d^) e ^NRy.NR j^^^ yg^j^^. 

■ ■ ■ <i]j]^ e on the main diagonal and zeros 

elsewhere. Superscript ^ denotes the transpose of a matrix. 



II. System Model 

A. Block Sparsity 

Let N = QR, where Q, R are positive integers, be the 
length of the sparse vector x. Furthermore, let x be composed 
of R equal length sub-vectors {Xr}^^i, that is. 



Xi 



where Xr 



Xr 1 



Instead of considering strict block sparsity where some of the 
sub-vectors {aJ^I^Li exactly equal to zero ||3|-|,6J, we let 
X be drawn from the Gaussian mixture density 



Pix) = ^^u}k,igN{x I 0; Dk,i), 

fc=i 1=1 

where K is an integer. 



L 



k 



1=1 



and 



K 

E 

fc=i 



^k 



(4) 



(5) 



Here ajj. denotes the probability of observing k information 
bearing blocks in a realization of x, the number of 
combinations how such k blocks can be arranged in x, and 
Wfc,; > the probabilities related to these arrangements. In 
the following, the input of ([TJ in the event that x is drawn 
according to the (fc,Z)th mixture density is denoted Xk^i- 

To impose a block sparse structure on the vector of interest, 
the diagonal covariance matrices in Q 

Dfc , = diag(df 



'fej = uia,g^U;^ , . . . , d^'''-'), l^l,...,Lk, (6) 

are required to be distinct for all {k, I) ^ {k', I') and taking 
only two values on the block diagonals 

di'^'^ = lo5^ and dl'^'' 



1, 



1 ^2 



Vr, fc, I. 



(7) 



Here 5'^ and cr^ represent the expected signal powers of the 
sparsity inducing and information bearing components, respec- 
tively. With these assumptions, the per-component variance 



1 



Mx 



1 



k.l\ 



tr(D 



R-k 



k.l 



(8) 



N " ■ " N ' ■ ' R R 
is independent of the permutation index I — 1, . . . , L/^. 

Definition 1. Let ct^ = 1 and assume that x is a RV with 
density Q. We say x is an approximately K block sparse 
RV if (5^ < and all matrices {Dk.i} satisfy (|6]l - ([HJ. If 
6^ — > 0+, we simply say that x is a K block sparse RV. (} 



B. Posterior Mean Estimation 

Let X be drawn according to (4]) and assume that we observe 
the noisy measurements y in (TJ. We assume that the noise 
n ^ ghiin I 0; o^Im) is Gaussian and independent of the 
signal X and the measurement matrix A. Furthermore, we let 
A be independent of x with independent identically distributed 
(IID) Gaussian entries with zero mean and variance 1/Af. 

Given the above assumptions, let 



y = Ax + n E 



(9) 



be the signal model postulated by the estimator We assign 
the postulated densities q{x) and (/(n) to the input and 
noise vectors, respectively. In the following, q{y \ A,x) = 
liv ^ y \ A,x) means that the realizations of the postulated 
measurement vectors y match the outputs y of the true signal 
model ([TJ, but it can be that the input q{x) ^ p{x) and 
noise q{n) ^ p{n) statistics are mismatched. If we define 
an expectation operator 



(3) ((•••»,^ 



q{x I y,A)dx = 



q[y I A,x)q{x)_^^^ 



q{y I A) 



(10) 



a (mismatched) MMSE estimate of x for the linear model ([TJ, 
given y, A, and q{y \ A^x), is then simply {{x))q. 

Lemma 1. The MMSE estimate of x for the signal model (Q 
reads {{x))p, i.e., q — p for all densities. 

Proof: The result follows by simple algebra and is omitted 
due to space constraints. ■ 

Proposition 1. The MMSE estimate of the K block sparse 
signal X with density (|4j, given noisy measurements ([TJ, reads 



fc=i 1=1 



^kjpk,i{y I A) 



p{y I A) 



Wk,i y, 



where 



Pk.Ay I A) ^ guiy I 0; ADk^iA" + a^I^), 
p{y I A) 



^^^k,iPkj{y I -4), 

k=l 1=1 



and 



W,.,_, = D,_iA'' (^AD,_,A'' 



(11) 

(12) 
(13) 

(14) 



Proof: Let A E M^^^ be a symmetric positive definite 
matrix. Then, using Gaussian integrals 



e2 



det{A) 



kb'A- 



''A-'b = 



xe 



"dx. 



(15) 
(16) 



where x,b E 

}-i 



dN 



27r^ 
dct(A) 

on Lemma [T| along with the identities 

I -U + VUyW = {I + UBVy\ (17) 
det (C^^ + UV^) = dct (/ + V^C^^U) det (C^^) , (18) 

where all matrices are assumed to be of proper size and B, C 
invertible, yields Proposition [T| ■ 
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Given the MMSE estimate of Proposition [T] we are now 
interested in computing the per-component MSE 



mse{<j')^E\\x-{{x))p\\yN, 



(19) 



when the dimensions of A grow large with fixed ratio /3 = 
N/M, and the number of blocks B = N/Q stays finite. The 
desired result is obtained in two steps: 1) the replica method 
is used in Sec. |III-A| to show that the original problem can 
be transformed to a set of simpler ones in the large system 
limit; 2) the solutions to the transformed problems are given 
in Sec. \m-B\ 

III. Main Results 

A. Equivalent AWGN Channel 

Let the indexes k = 1, . . . ,K and ^ = 1, . . . , Lj. be as in 
the previous section. Define a set of additive white Gaussian 
noise (AWGN) channels for all k and I 



N) 



(20) 



where Xk.i is a zero-mean Gaussian RV with covariance Dkj 
and ^k,i > 0. Let the events of observing the (fc. ^)th channel 
( pOj l be independent and occur with probability lu^.i for all k 
and /. Furthermore, let 



(...)(M)A 



/•••.(iMl^M)diM 



qizk,i Xk,i)q{xk.i) 
^ — dajfej, (21) 

be an expectation operator similar to ( [TOj i, but related to the 
{k, 0th AWGN channel (|20|. The MMSE estimate of Xk^i 
given the channel outputs Zk.i is then given by 



{xk,i)f-''^ = Dk,i {Dk,i + ^IiIn) ' Zk,i. 



(22) 



(k,i) 



We denote the per-component MSE of the estimates {xkj)p 



rnseiY\ek,i) = HW^kjf - E\\{Ak,iy^''m'}/N, (23) 

where the expectation is w.r.t the joint distribution of all vari- 
ables associated with (|20]l. The per-component MSE averaged 
over the realizations of the channels (|20ll is thus 



K Lk 

mseeq(U,lz}) = 5]5]c.,.,mse(q^'')(42_,). (24) 

k=l 1=1 

Claim 1. Let x be (approximately) K block sparse as given 
in Definition [7] In the large system limit when M, N,Q ^ oo 
with finite and fixed ratios j3 = N/M and R — N/Q, 

mse(cr2) ^ mseeq({Cfe,/}), (25) 

where mse(cr^) is given in ([19]) and mseeq({Cfe /}) < p4l ). The 
noise variance ; > 0, on the other hand, is the solution to 
the fixed point equation 



ik,l 



a2 + /3mse(^^')(^^ )^ 



(26) 



for all k and I. 



Proof: The proof is based on the replica method (see, e.g., 
[[Sj-pT) for similar results in communication theory and signal 



processing) from statistical physics. The main difference to the 
standard approach is that here the elements of the input vector 
x are neither independent nor identically distributed. Thus, 



the decoupling result proved for the CDMA systems 1 1 1 1 
cannot be straightforwardly extended to our case. Alternative 
derivation is sketched below and in part in the Appendix. 

To start, let us define a modified partition function related 
to the posterior mean estimator ([TO]l as p9) 

Z{y, A, A) - j e^'^'qiy \ A, x)q{x)dx, (27) 

where A e is a constant vector The posterior mean 
estimator of x given in ( fTO] ) can then be written as the gradient 
with respect to (w.r.t) A at A = of the free energy, i.e.. 



{{x)), = WxlogZ{y,A,\) 



(28) 



Similarly, if ([28]) is the optimum MMSE estimate, that is q 
p, and we denote the free energy density 

1 



/(y,A,A) = -logZ(y,A,A) 
the average per-component MMSE is given by 
mse = tr(E{vL/(y,A,A)|^^J 



(29) 



(30) 

where the expectations are w.rt. the joint density of {y,A). 
Unlike in fT9], however, direct computation of the free energy 
(density) is not possible here. We thus resort to the non- 
rigorous RM to calculate ( [29] l and then use the relation ( |30[ ) to 
obtain the final result. The details are given in the Appendix. 



B. Performance of MMSE Estimation of Block Sparse Signals 

Claim [T] asserts that the MSE of the estimator ( [TT] ) in the 
original setting ([T]) can be obtained in the large system limit 
from ( [24] i. Given the Claim [Tj holds, a bit of algebra gives the 
following proposition. 



Proposition 2. Let a1 = 1, S/l i > and N 

per-component MSE (|23|l is given by 



oo. Then the 



k ek 



k,l 



R-k SX 



2f2 



k,l 



k,U 



Re 



k,l 



R 



kU 



5^' 



(31) 



where j is the solution of \26) . When the source is strictly 
block sparse, that is 5^ — !■ 0+, 

ek ''="^ \{-l + Pk+<T^ + v/4a2 + (-l + /3fc + a2)2), 

(32) 

where = j = 1, . . . , L^. and we denoted (3k — 'JiP faf 
notational convenience. Thus, given Claim^holds, the MMSE 
of the block sparse system is given by 



K 

mse(o-^) = y^Wfc 



ek 



k=l 



Rek 



1' 



(33) 



in the large system limit. 



Remark 1. Note that the MSE is independent of the distribu- 



tion {ijJk,i}i ' that makes up ujk (see ^). 
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13 = 10: a' = 0-1 



g = 5.ct'= am.K = i 



Number of non-zero blocks R 



Fig. 1. Per-component MMSE for the CS of block sparse signal. Solid lines 
are obtained from Proposition [5] with 5^ — > 0"'' and markers depict numerical 
Monte Carlo simulations where cr^ = 1, 5^ = 10"*^ and = 1200. 

Remark 2. As 0+, the noise variance ( (32] l becomes 

the Tse-Hanly solution p) for equal power users but with a 
modified user load I3k = ^J3. The same noise variance is 
obtained by a genie-aided MMSE receiver, conditioned on the 
event that x is sampled from one of the mixtures indexed 
by k. The MSE ( |33| l, on the other hand, is a summation of the 
related MSEs but weighted with the probability of having k 
non-zero blocks in a realization of the source vector x. Thus, 
there is no loss in not knowing the positions of the zero blocks 
in advance if we use the optimum MMSE receiver for very 
large K block sparse systems. Note that the equivalent AWGN 



channel model in Sec. III-A already implies this point. For 



practical settings with finite sized sensing matrices, however, 
this does not strictly hold. (} 

Corollary 1. The MMSE estimator ( [TT] ) has the same MSE in 
the large system limit as a genie-aided MMSE estimator that 
knows in advance the positions of the non-zero blocks in x. 

To empirically verify the analytical results, we have plotted 
in Fig. [T] the MSE of estimator ( [TT| , obtained via computer 
simulations. The theoretical MSE given in Proposition |2] is 
given as well. In all simulation cases we have set Wk.i = uj = 
so that ujk = i^Lk. For the selected cases the 
theory matches Monte Carlo simulations very well. 

IV. Conclusions 

Minimum mean square error estimation of block sparse 
signals from noisy linear measurements was considered. The 
main result of the paper is the closed-form MMSE for the CS 
of such signals. The solution turned out to be of a particularly 
simple form, namely, the Tse-Hanly formula with a scaling by 
parameters that depend on the sparsity pattern of the source. 
The result implies that if the statistics of the block sparse 
CS problem are known, the MMSE is independent of the 
knowledge about the positions of the non-vanishing blocks. 

Appendix 
Derivation of Claim[T] 

Let us assume that the free energy density (|29| is self- 
averaging w.rt. the quenched randomness (A, y) in the large 



system limit iV — > oo. Then (|29| can be written as 

/ = lim 1 lim ^ log E^,^{Z(y, A, A)n, 



(34) 



where m is a real parameter. The replica trick consists of 
treating u as an integer while calculating the expectations, but 
taking the limit as if u was real valued outside the expectation. 
The second step is to exchange the limits and write the 
power of u inside the expectation using the set of 
replicated random vectors, resulting to. 



/rm = lim ^ lim ^ log ^j^^ (A) 



(35) 



where x^a are IID with density p{x) and 



l"^ a=l 



p{V I A,X^a\)Axya\ 



(36) 

Unfortunately, these steps are non-rigorous and there is no 
general proof yet under which conditions /rm equals /. For 
more discussion and details, see, e.g., ||8)-|[T3|. 

Let a;[o] be the true vector of interest, independent of 
{^[a]}a=i ^'^'^ distributed as x. Plugging y = AcC[o] + n 
to ([36|, the average over the additive noise vector n ^ 
gM{n I 0; o^Im) can be assessed using ([TSj. Furthermore, 
recalling that the true and postulated source vectors have 
GM densities (|4li, we obtain (|37ji at the top of the next 



page, 

X\r,l ' 



where E 







"{a=[al} 

9N{x\a\ I 0; Dk,i),a = 0, 1 



denotes expectation over the vectors 
, u. Next, let 



(38) 



be a RV composed of u + 1 sub- vectors Va — P ^^^Ax[a] G 
M*^. Also denote Q^ i = Qj;."' Im where q["] e 



]g(«+i)x(«+i) ^jjj jjjg (^a,b)th element of Q^."^ is given by 

Qkf = ^Ja]^[b]/^ where X[a],X[b] - gNix \ 0; Dk,i) for 
all a, 6 = 0, 1, ... , u. Then, ( [37] l can be written in the form 



(27ra^)- 
u+1 



k=l 1=1 



xESt^iilWH^Io^Q.^e-'"'^"}}, (39) 

where S = (/S/ct^) [7„ - 1„iT/(1 + u)] e k(«+i)x(«+i)^ 
Using ([TSj to integrate over the Gaussian RV v in ( [39] l yields 



4") (A) 



K Lfc 



k=l 1=1 

where 



.]} 



n 



(27rCT2 



[l + u) det(J + SQ^'; 



(40) 



(41) 



To compute the expectations w.rt., {a;[a]}a=o ^'^^ ^ ~ 
1, . . . ,K,l — 1, . . . ,Lh in (pOb, we write the measure of the 
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(A) 



_ (27rcr2)" 



K Lfc 

EE 

k=l 1=1 



{a=[a]} 



exp 



2cr2(u + 1) 



E "ii^^w ir - E E(^^m)"^(^^[''i) 

a— 



(37) 



matrix Q^,"'' as 



I a=l a<6 J 

and integrate w.rt. (|42]|. Writing the Dirac measures in (|42]| 
in terms of (inverse) Laplace transform and invoking saddle 
point integration (see |12, Appendix A] for details), we get 

4"H^)=EE^Me^^-^^\ (43) 

fc=i 1=1 

where G is a symmetric matrix. To obtain 

( |43| l, we defined an auxiliary function 

- inf {tr(Q2Qg)- lim iV"! log^i"/(Qi::/, A; iV)} 



where 



{==[al} 



(44) 



(45) 



denotes the moment generating function (MGF) of ( |42] l. We 

(n) 



also wrote Xn = [a::[o] 



g where n = 



(r — 1)Q + g for r = 1, . . . ,R and q — I, . . . ,Q in the 
notation of ([3]l. 

To make the optimization problems in (|44]) tractable, we 
assume that their solutions are the replica symmetric (RS) ma- 
trices (see, e.g., p)-p3) on discussion about this assumption) 

Qld = {Pk,i - qk,i)Iu+i + qkAu+i'^l+ii (46) 
Q*k,i = iPk,i - qk.i)Iu+i + (47) 

respectively, where pj. qk.i,Pk.i, Qk.i are real parameters. Un- 
der the RS assumption, we get the simplifications 

^<Qk}Qt}) = (" + ^){Pk,lPk,l + uqk,iqk,l) (48) 



and 



^0, 



(49) 



> 0. 



^\og{2na')~-\og{u+l) (50) 

(51) 



From the first extremum in (|44]) one obtains pk.i = and 



qk,i = [o-^ + ;5(pfc,; - qk.i)] 



(52) 



where pj. ; and q^ i are left as arbitrary but fixed parameters for 
now. To proceed with the second optimization problem in ( |44] l, 
we need to evaluate the MGF (|45| l under the RS assumption. 

Using ([TSj right-to-left in ( |45[ ), using the RS assumption 
( |46l ) - ( |47] l and recalling that the replicas {a;[a]}"^Q are IID 
yields after some algebra 

^gfe,, A;iV) = d^\qk,l) I Ei'^'') I x; q-jl^)] 



d2 



kJi 



(53) 



where the expectations E^*^-') are w.r.t. zero-mean Gaus- 
sian RVs with covariance Dk^- The normalization factor 

cl^\qk,i) — [(1 + u)(27r/gfc,i)"]^^^ is due the inti'oduction 
of the Gaussian densities in (fSSll. Since 



1, 



(54) 

(55) 
(56) 



the second optimization in ( |44] i reduces to the conditions 

N^oo 

q,,^ lim 7V-iE('=-')|||(i)('=^')||n, 

when A = and m — > 0. The expectations in ( |55] l and ( |56l ) 
are w.rt 2:^, and the independent zero-mean Gaussian RVs 
X, X with covariance Dk,i as in ( |53] l. We also write 

PN{zk,i) = ^'i'^\gN{zk,i I x; q-^jlN)}, (57) 

so that 

1 

P{Zkj) ' 

Note that (fSHll is the MMSE estimator of the Gaussian channel 



{gNizk,i I x; q^^I^)}, 
-Ei^^'^xgNizkA I qk}lN)}- (58) 



2fej=a; + r,fc_,eM^, (59) 

when the receiver knows the correct distributions of rjf. ; ^ 
57v(^7 I 0; q'^]lfq) and a; ^ gAr(a; | 0; Dk,i)- Furthermore, 
from ( |55] l and (|56| we get 



lim l[E(^-'){||a=|p}-E('^^'Hll«''^lP}] 



P/£,i — <Zfc,Z, 



(60) 



where msegq''^ is the MMSE of the Gaussian channel ( [59) l. 
The free energy density under the RS assumption reads 



frm-rs = lim ^ Km 



^i°4ee 

\ fc=i 1=1 



uJk,ie 



AfT<"'(A) 



(61) 



Switching the order of the limits once more yields 



• k=l 1=1 



r if Lk 



EE^*^-^' 

k=l 1=1 



,JVT<")(A) 



(62) 
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Since 



ri?(A)^0, 



(63) 



by (|5]) the denominator becomes just unity and can be omitted. 
For the latter part, 



(64) 



where the derivative is assessed 

2/3 

1 . 1 



+ ^ [1 + log(27r/gfe,0] - ^ log(2^ea2) 
1 



+ lim — pN{zk,i)hNizk.h\qk,i)dzk,i. (65) 

N^oo I\ J 

Note that we defined above the function 

hN{Zk,U^, qk.l) 

= log (Ei'=^"{e^'V(^fc,; I &n))), (66) 

for notational convenience. Recalling ( |63| ), we finally have the 
RS free energy density 



K L 



+ 9 X! X! '^fc,/ [log(27r/gfe^/) + loggfej - 2qkjqk,i] 



k=l 1=1 



K L 



+ lim —y^^y^^uJk,i PNizk.i)hNizk.^i,X,qk.i)dzk.i, (67) 

where only the last term depends on A and is relevant for the 
assessment of the MSB, as given in ([30|. 

The final task is to compute \/\^hjq{z, '^7^^)|_^_q- First, 



^\hN{Zk,U \ik,. 



(68) 

so that the estimator ( (58] l can also be written as 

{x)l^'''> ^VxhN{zk,i,\,qk,i)\^^o- ^69) 
Proceeding similarly, after a bit of algebra we obtain the 
conditional covariance matrix of the error 



Ep'\zkj)^\7lhN{zk.i,\qkA) 



A=0 



which is also the error covariance of the estimator (|58|. 
Thus, by (|30]l, the per-component MSB of the original MMSB 
estimator given in Proposition [T] reads 



mse 



k=i 1=1 V'' 



PNizk^i)E']l^'^\zk,i)dzk,i 
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m and d60b as 



which can be written due to 



k=l 1=1 



completing the proof. 
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